Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 14 de 14
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Database (Oxford) ; 20232023 03 04.
Artigo em Inglês | MEDLINE | ID: mdl-36869839

RESUMO

Long non-coding ribonucleic acids (lncRNAs) account for the largest group of non-coding RNAs. However, knowledge about their function and regulation is limited. lncHUB2 is a web server database that provides known and inferred knowledge about the function of 18 705 human and 11 274 mouse lncRNAs. lncHUB2 produces reports that contain the secondary structure fold of the lncRNA, related publications, the most correlated coding genes, the most correlated lncRNAs, a network that visualizes the most correlated genes, predicted mouse phenotypes, predicted membership in biological processes and pathways, predicted upstream transcription factor regulators, and predicted disease associations. In addition, the reports include subcellular localization information; expression across tissues, cell types, and cell lines, and predicted small molecules and CRISPR knockout (CRISPR-KO) genes prioritized based on their likelihood to up- or downregulate the expression of the lncRNA. Overall, lncHUB2 is a database with rich information about human and mouse lncRNAs and as such it can facilitate hypothesis generation for many future studies. The lncHUB2 database is available at https://maayanlab.cloud/lncHUB2. Database URL: https://maayanlab.cloud/lncHUB2.


Assuntos
RNA Longo não Codificante , Humanos , Animais , Camundongos , Linhagem Celular , Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas , Bases de Dados Factuais , Conhecimento
2.
BMC Bioinformatics ; 23(1): 374, 2022 Sep 13.
Artigo em Inglês | MEDLINE | ID: mdl-36100892

RESUMO

The L1000 technology, a cost-effective high-throughput transcriptomics technology, has been applied to profile a collection of human cell lines for their gene expression response to > 30,000 chemical and genetic perturbations. In total, there are currently over 3 million available L1000 profiles. Such a dataset is invaluable for the discovery of drug and target candidates and for inferring mechanisms of action for small molecules. The L1000 assay only measures the mRNA expression of 978 landmark genes while 11,350 additional genes are computationally reliably inferred. The lack of full genome coverage limits knowledge discovery for half of the human protein coding genes, and the potential for integration with other transcriptomics profiling data. Here we present a Deep Learning two-step model that transforms L1000 profiles to RNA-seq-like profiles. The input to the model are the measured 978 landmark genes while the output is a vector of 23,614 RNA-seq-like gene expression profiles. The model first transforms the landmark genes into RNA-seq-like 978 gene profiles using a modified CycleGAN model applied to unpaired data. The transformed 978 RNA-seq-like landmark genes are then extrapolated into the full genome space with a fully connected neural network model. The two-step model achieves 0.914 Pearson's correlation coefficients and 1.167 root mean square errors when tested on a published paired L1000/RNA-seq dataset produced by the LINCS and GTEx programs. The processed RNA-seq-like profiles are made available for download, signature search, and gene centric reverse search with unique case studies.


Assuntos
Aprendizado Profundo , Perfilação da Expressão Gênica , Humanos , RNA-Seq , Transcriptoma
3.
Curr Protoc ; 2(7): e487, 2022 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-35876555

RESUMO

The Library of Integrated Network-based Cellular Signatures (LINCS) was an NIH Common Fund program that aimed to expand our knowledge about human cellular responses to chemical, genetic, and microenvironment perturbations. Responses to perturbations were measured by transcriptomics, proteomics, cellular imaging, and other high content assays. The second phase of the LINCS program, which lasted 7 years, involved the engagement of six data and signature generation centers (DSGCs) and one data coordination and integration center (DCIC). The DSGCs and the DCIC developed several digital resources, including tools, databases, and workflows that aim to facilitate the use of the LINCS data and integrate this data with other publicly available data. The digital resources developed by the DSGCs and the DCIC can be used to gain new biological and pharmacological insights that can lead to the development of novel therapeutics. This protocol provides step-by-step instructions for processing the LINCS data into signatures, and utilizing the digital resources developed by the LINCS consortia for hypothesis generation and knowledge discovery. © 2022 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Navigating L1000 tools and data in CLUE.io Basic Protocol 2: Computing signatures from the L1000 data with the CD method Basic Protocol 3: Analyzing lists of differentially expressed genes and querying them against the L1000 data with BioJupies and the Bulk RNA-seq Appyter Basic Protocol 4: Utilizing the L1000FWD resource for drug discovery Basic Protocol 5: KINOMEscan and the KINOMEscan Appyter Basic Protocol 6: LINCS P100 and GCP Proteomics Assays Basic Protocol 7: The LINCS Joint Project (LJP) Basic Protocol 8: The LINCS Data Portals and SigCom LINCS Basic Protocol 9: Creating and analyzing signatures with iLINCS.


Assuntos
Descoberta de Drogas , Proteômica , Bases de Dados Factuais , Descoberta de Drogas/métodos , Biblioteca Gênica , Humanos , Transcriptoma
4.
Nucleic Acids Res ; 50(W1): W697-W709, 2022 07 05.
Artigo em Inglês | MEDLINE | ID: mdl-35524556

RESUMO

Millions of transcriptome samples were generated by the Library of Integrated Network-based Cellular Signatures (LINCS) program. When these data are processed into searchable signatures along with signatures extracted from Genotype-Tissue Expression (GTEx) and Gene Expression Omnibus (GEO), connections between drugs, genes, pathways and diseases can be illuminated. SigCom LINCS is a webserver that serves over a million gene expression signatures processed, analyzed, and visualized from LINCS, GTEx, and GEO. SigCom LINCS is built with Signature Commons, a cloud-agnostic skeleton Data Commons with a focus on serving searchable signatures. SigCom LINCS provides a rapid signature similarity search for mimickers and reversers given sets of up and down genes, a gene set, a single gene, or any search term. Additionally, users of SigCom LINCS can perform a metadata search to find and analyze subsets of signatures and find information about genes and drugs. SigCom LINCS is findable, accessible, interoperable, and reusable (FAIR) with metadata linked to standard ontologies and vocabularies. In addition, all the data and signatures within SigCom LINCS are available via a well-documented API. In summary, SigCom LINCS, available at https://maayanlab.cloud/sigcom-lincs, is a rich webserver resource for accelerating drug and target discovery in systems pharmacology.


Assuntos
Metadados , Transcriptoma , Transcriptoma/genética , Ferramenta de Busca
5.
Front Immunol ; 12: 636289, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33763080

RESUMO

Although widely prevalent, Lyme disease is still under-diagnosed and misunderstood. Here we followed 73 acute Lyme disease patients and uninfected controls over a period of a year. At each visit, RNA-sequencing was applied to profile patients' peripheral blood mononuclear cells in addition to extensive clinical phenotyping. Based on the projection of the RNA-seq data into lower dimensions, we observe that the cases are separated from controls, and almost all cases never return to cluster with the controls over time. Enrichment analysis of the differentially expressed genes between clusters identifies up-regulation of immune response genes. This observation is also supported by deconvolution analysis to identify the changes in cell type composition due to Lyme disease infection. Importantly, we developed several machine learning classifiers that attempt to perform various Lyme disease classifications. We show that Lyme patients can be distinguished from the controls as well as from COVID-19 patients, but classification was not successful in distinguishing those patients with early Lyme disease cases that would advance to develop post-treatment persistent symptoms.


Assuntos
Leucócitos Mononucleares/imunologia , Doença de Lyme/genética , Adulto , COVID-19/genética , COVID-19/imunologia , Citocinas/genética , Citocinas/imunologia , Feminino , Seguimentos , Humanos , Leucócitos Mononucleares/química , Doença de Lyme/sangue , Doença de Lyme/imunologia , Aprendizado de Máquina , Masculino , Pessoa de Meia-Idade , Estudos Prospectivos , RNA-Seq
6.
Curr Protoc ; 1(3): e90, 2021 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-33780170

RESUMO

Profiling samples from patients, tissues, and cells with genomics, transcriptomics, epigenomics, proteomics, and metabolomics ultimately produces lists of genes and proteins that need to be further analyzed and integrated in the context of known biology. Enrichr (Chen et al., 2013; Kuleshov et al., 2016) is a gene set search engine that enables the querying of hundreds of thousands of annotated gene sets. Enrichr uniquely integrates knowledge from many high-profile projects to provide synthesized information about mammalian genes and gene sets. The platform provides various methods to compute gene set enrichment, and the results are visualized in several interactive ways. This protocol provides a summary of the key features of Enrichr, which include using Enrichr programmatically and embedding an Enrichr button on any website. © 2021 Wiley Periodicals LLC. Basic Protocol 1: Analyzing lists of differentially expressed genes from transcriptomics, proteomics and phosphoproteomics, GWAS studies, or other experimental studies Basic Protocol 2: Searching Enrichr by a single gene or key search term Basic Protocol 3: Preparing raw or processed RNA-seq data through BioJupies in preparation for Enrichr analysis Basic Protocol 4: Analyzing gene sets for model organisms using modEnrichr Basic Protocol 5: Using Enrichr in Geneshot Basic Protocol 6: Using Enrichr in ARCHS4 Basic Protocol 7: Using the enrichment analysis visualization Appyter to visualize Enrichr results Basic Protocol 8: Using the Enrichr API Basic Protocol 9: Adding an Enrichr button to a website.


Assuntos
Descoberta do Conhecimento , Software , Animais , Biologia Computacional , Genômica , Humanos , RNA-Seq
7.
Patterns (N Y) ; 2(3): 100213, 2021 Mar 12.
Artigo em Inglês | MEDLINE | ID: mdl-33748796

RESUMO

Jupyter Notebooks have transformed the communication of data analysis pipelines by facilitating a modular structure that brings together code, markdown text, and interactive visualizations. Here, we extended Jupyter Notebooks to broaden their accessibility with Appyters. Appyters turn Jupyter Notebooks into fully functional standalone web-based bioinformatics applications. Appyters present to users an entry form enabling them to upload their data and set various parameters for a multitude of data analysis workflows. Once the form is filled, the Appyter executes the corresponding notebook in the cloud, producing the output without requiring the user to interact directly with the code. Appyters were used to create many bioinformatics web-based reusable workflows, including applications to build customized machine learning pipelines, analyze omics data, and produce publishable figures. These Appyters are served in the Appyters Catalog at https://appyters.maayanlab.cloud. In summary, Appyters enable the rapid development of interactive web-based bioinformatics applications.

8.
Diabetes ; 70(2): 589-602, 2021 02.
Artigo em Inglês | MEDLINE | ID: mdl-33067313

RESUMO

Diabetic kidney disease (DKD) remains the most common cause of kidney failure, and the treatment options are insufficient. Here, we used a connectivity mapping approach to first collect 15 gene expression signatures from 11 DKD-related published independent studies. Then, by querying the Library of Integrated Network-based Cellular Signatures (LINCS) L1000 data set, we identified drugs and other bioactive small molecules that are predicted to reverse these gene signatures in the diabetic kidney. Among the top consensus candidates, we selected a PLK1 inhibitor (BI-2536) for further experimental validation. We found that PLK1 expression was increased in the glomeruli of both human and mouse diabetic kidneys and localized largely in mesangial cells. We also found that BI-2536 inhibited mesangial cell proliferation and extracellular matrix in vitro and ameliorated proteinuria and kidney injury in DKD mice. Further pathway analysis of the genes predicted to be reversed by the PLK1 inhibitor was of members of the TNF-α/NF-κB, JAK/STAT, and TGF-ß/Smad3 pathways. In vitro, either BI-2536 treatment or knockdown of PLK1 dampened the NF-κB and Smad3 signal transduction and transcriptional activation. Together, these results suggest that the PLK1 inhibitor BI-2536 should be further investigated as a novel therapy for DKD.


Assuntos
Diabetes Mellitus , Nefropatias Diabéticas , Preparações Farmacêuticas , Animais , Nefropatias Diabéticas/tratamento farmacológico , Rim , Camundongos , Pteridinas , Transcriptoma
9.
Glia ; 68(10): 2148-2166, 2020 10.
Artigo em Inglês | MEDLINE | ID: mdl-32639068

RESUMO

Glioblastoma (GBM) is the most aggressive primary brain tumor. In addition to being genetically heterogeneous, GBMs are also immunologically heterogeneous. However, whether the differences in immune microenvironment are driven by genetic driver mutation is unexplored. By leveraging the versatile RCAS/tv-a somatic gene transfer system, we establish a mouse model for Classical GBM by introducing EGFRvIII expression in Nestin-positive neural stem/progenitor cells in adult mice. Along with our previously published Nf1-silenced and PDGFB-overexpressing models, we investigate the immune microenvironments of the three models of human GBM subtypes by unbiased multiplex profiling. We demonstrate that both the quantity and composition of the microenvironmental myeloid cells are dictated by the genetic driver mutations, closely mimicking what was observed in human GBM subtypes. These myeloid cells express high levels of the immune checkpoint protein PD-L1; however, PD-L1 targeted therapies alone or in combination with irradiation are unable to increase the survival time of tumor-bearing mice regardless of the driver mutations, reflecting the outcomes of recent human trials. Together, these results highlight the critical utility of immunocompetent mouse models for preclinical studies of GBM, making these models indispensable tools for understanding the resistance mechanisms of immune checkpoint blockade in GBM and immune cell-targeting drug discovery.


Assuntos
Neoplasias Encefálicas/genética , Neoplasias Encefálicas/imunologia , Glioblastoma/genética , Glioblastoma/imunologia , Inibidores de Checkpoint Imunológico/uso terapêutico , Mutação/fisiologia , Animais , Neoplasias Encefálicas/tratamento farmacológico , Neoplasias Encefálicas/patologia , Feminino , Glioblastoma/tratamento farmacológico , Glioblastoma/patologia , Humanos , Masculino , Camundongos , Camundongos Endogâmicos C57BL , Camundongos Transgênicos , Células Tumorais Cultivadas
11.
Cell Syst ; 9(5): 417-421, 2019 11 27.
Artigo em Inglês | MEDLINE | ID: mdl-31677972

RESUMO

As more digital resources are produced by the research community, it is becoming increasingly important to harmonize and organize them for synergistic utilization. The findable, accessible, interoperable, and reusable (FAIR) guiding principles have prompted many stakeholders to consider strategies for tackling this challenge. The FAIRshake toolkit was developed to enable the establishment of community-driven FAIR metrics and rubrics paired with manual and automated FAIR assessments. FAIR assessments are visualized as an insignia that can be embedded within digital-resources-hosting websites. Using FAIRshake, a variety of biomedical digital resources were manually and automatically evaluated for their level of FAIRness.


Assuntos
Disseminação de Informação/métodos , Internet/tendências , Sistemas On-Line/normas , Recursos em Saúde/normas , Humanos
12.
Nucleic Acids Res ; 47(W1): W571-W577, 2019 07 02.
Artigo em Inglês | MEDLINE | ID: mdl-31114885

RESUMO

The frequency by which genes are studied correlates with the prior knowledge accumulated about them. This leads to an imbalance in research attention where some genes are highly investigated while others are ignored. Geneshot is a search engine developed to illuminate this gap and to promote attention to the under-studied genome. Through a simple web interface, Geneshot enables researchers to enter arbitrary search terms, to receive ranked lists of genes relevant to the search terms. Returned ranked gene lists contain genes that were previously published in association with the search terms, as well as genes predicted to be associated with the terms based on data integration from multiple sources. The search results are presented with interactive visualizations. To predict gene function, Geneshot utilizes gene-gene similarity matrices from processed RNA-seq data, or from gene-gene co-occurrence data obtained from multiple sources. In addition, Geneshot can be used to analyze the novelty of gene sets and augment gene sets with additional relevant genes. The Geneshot web-server and API are freely and openly available from https://amp.pharm.mssm.edu/geneshot.


Assuntos
Genes , Software , Mineração de Dados , Expressão Gênica , Internet , Publicações , RNA-Seq , Pesquisadores , Interface Usuário-Computador
13.
Nucleic Acids Res ; 47(W1): W212-W224, 2019 07 02.
Artigo em Inglês | MEDLINE | ID: mdl-31114921

RESUMO

Identifying the transcription factors (TFs) responsible for observed changes in gene expression is an important step in understanding gene regulatory networks. ChIP-X Enrichment Analysis 3 (ChEA3) is a transcription factor enrichment analysis tool that ranks TFs associated with user-submitted gene sets. The ChEA3 background database contains a collection of gene set libraries generated from multiple sources including TF-gene co-expression from RNA-seq studies, TF-target associations from ChIP-seq experiments, and TF-gene co-occurrence computed from crowd-submitted gene lists. Enrichment results from these distinct sources are integrated to generate a composite rank that improves the prediction of the correct upstream TF compared to ranks produced by individual libraries. We compare ChEA3 with existing TF prediction tools and show that ChEA3 performs better. By integrating the ChEA3 libraries, we illuminate general transcription factor properties such as whether the TF behaves as an activator or a repressor. The ChEA3 web-server is available from https://amp.pharm.mssm.edu/ChEA3.


Assuntos
Biologia Computacional/métodos , Bases de Dados Genéticas , Biblioteca Gênica , Fatores de Transcrição/genética , Sequenciamento de Cromatina por Imunoprecipitação/métodos , Conjuntos de Dados como Assunto , Regulação da Expressão Gênica/genética , Redes Reguladoras de Genes/genética , Humanos
14.
Nucleic Acids Res ; 47(W1): W183-W190, 2019 07 02.
Artigo em Inglês | MEDLINE | ID: mdl-31069376

RESUMO

High-throughput experiments produce increasingly large datasets that are difficult to analyze and integrate. While most data integration approaches focus on aligning metadata, data integration can be achieved by abstracting experimental results into gene sets. Such gene sets can be made available for reuse through gene set enrichment analysis tools such as Enrichr. Enrichr currently only supports gene sets compiled from human and mouse, limiting accessibility for investigators that study other model organisms. modEnrichr is an expansion of Enrichr for four model organisms: fish, fly, worm and yeast. The gene set libraries within FishEnrichr, FlyEnrichr, WormEnrichr and YeastEnrichr are created from the Gene Ontology, mRNA expression profiles, GeneRIF, pathway databases, protein domain databases and other organism-specific resources. Additionally, libraries were created by predicting gene function from RNA-seq co-expression data processed uniformly from the gene expression omnibus for each organism. The modEnrichr suite of tools provides the ability to convert gene lists across species using an ortholog conversion tool that automatically detects the species. For complex analyses, modEnrichr provides API access that enables submitting batch queries. In summary, modEnrichr leverages existing model organism databases and other resources to facilitate comprehensive hypothesis generation through data integration.


Assuntos
Bases de Dados Genéticas , Expressão Gênica/genética , Biblioteca Gênica , Biblioteca Genômica , Software , Animais , Biologia Computacional , Ontologia Genética , Humanos , Metadados
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...